Hierarchical Clustering for Semi-Supervised Ground Truth Generation
نویسندگان
چکیده
Supervised learning tasks can require a large collection of labeled data for accurate pattern recognition. For recognition of handwritten characters, manually producing ground truths can be very tedious. In this paper, we propose a semisupervised hierarchical clustering method to reduce the necessary amount of human effort required for labeling a dataset of handwritten characters. The experimental results demonstrate that the approach can improve labeling accuracy over baseline methods.
منابع مشابه
Comprehensive Cross-Hierarchy Cluster Agreement Evaluation
Hierarchical clustering represents a family of widely used clustering approaches that can organize objects into a hierarchy based on the similarity in objects’ feature values. One significant obstacle facing hierarchical clustering research today is the lack of general and robust evaluation methods. Existing works rely on a range of evaluation techniques including both internal (no ground-truth...
متن کاملActive semi-supervised fuzzy clustering
Clustering algorithms are increasingly employed for the categorization of image databases, in order to provide users with database overviews and make their access more effective. By including information provided by the user, the categorization process can produce results that come closer to user’s expectations. To make such a semi-supervised categorization approach acceptable for the user, thi...
متن کاملA Semi-Supervised Clustering Method Based on Graph Contraction and Spectral Graph Theory
Semi-supervised learning is a machine learning framework where learning from data is conducted by utilizing a small amount of labeled data as well as a large amount of unlabeled data (Chapelle et al., 2006). It has been intensively studied in data mining and machine learning communities recently. One of the reasons is that, it can alleviate the time-consuming effort to collect “ground truth” la...
متن کاملSemi-supervised Fuzzy Clustering Algorithms for Change Detection in Remote Sensing Images
For the problem of change detection it is difficult to have sufficient amount of ground truth information that is needed in supervised learning. On the contrary it is easy to identify a few labeled patterns by the experts. In this situation to avoid wastage of available information semi-supervision is suggestible to enhance the performance of unsupervised ones. Here we present the fuzzy cluster...
متن کاملSSHMT: Semi-supervised Hierarchical Merge Tree for Electron Microscopy Image Segmentation
Region-based methods have proven necessary for improving segmentation accuracy of neuronal structures in electron microscopy (EM) images. Most region-based segmentation methods use a scoring function to determine region merging. Such functions are usually learned with supervised algorithms that demand considerable ground truth data, which are costly to collect. We propose a semi-supervised appr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015